A Method of Inputting Text into a Data Processing Apparatus 



This invention relates to a method of inputting text into a data processing apparatus, e.g. a 
personal computer or a personal digital assistant, and in particular such a method in which 
words are inputted into the data processing apparatus in a logical way from a database. 

Background of the Invention 
Various methods have been devised for inputting text into a data processing apparatus. 
However, in most such methods, each word, phrase or sentence has to be inputted, by typing, 
afresh every time it is required. As more and more words, phrases or sentences are inputted, 
more spelling mistakes will occur. In addition, although there are ways of locating all 
occurrences of the same word, phrase or sentence in a document file in a data processing 
apparatus, e.g. a computer, all such occurrences of the word, phrase or sentence are located in 
isolation, i.e. out of the context in which such are located. 

It is thus an object of the present invention to provide a method of inputting text into such 
an apparatus in which the aforesaid shortcomings are mitigated, or at least to provide a useful 
alternative to the public. 

Summary of the Invention 
According to a first aspect of the present invention, there is provided a method of 
inputting text into a data processing apparatus, including the steps of (a) providing a database 
of words arranged hierarchically into a plurality of levels; (b) displaying at least one window 
on a visual display unit of said data processing apparatus; (c) showing a plurality of said words 
in said at least one window; (d) choosing a word or words from said plurality of words shown 
in said at least one window; and (e) outputting an output in response to choosing of said word(s) 
in said step (d). 

According to a second aspect of the present invention, there is provided a computer 
program configured and executable, when installed in a data processing apparatus, to (a) 
provide in said data processing apparatus a database of words arranged hierarchically into a 
plurality of levels, (b) display at least one window on a visual display unit of said data 
processing apparatus, (c) show a plurality of said words in said at least one window, (d) choose 
a word or words from said plurality of words shown in said at least one window; and (e) output 
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an output in response to choosing of said word(s) in (d) above. 



The present invention is based on the realization that all human languages have a common 
basic structure. For example, for each topic, there is at least a topic description. Such topic 
descriptions relate to:- 



Colour Number 
Width Depth 
Amount Extent 
Date Month 
Severity: mild, moderate, 



Length Height 

Frequency Unit (kg) 

Place Person 

Year Speed 



Weight Diameter 

Unit (cm) Unit (km) 

Activity Time 
Price 



The present invention is also based on a hierarchical arrangement of the topic descriptions. 
For example, in the following example, a "." is used for indicating a sub-level:- 



Example 1 
Cough 
.Day\01 
.Day\02 
.Day\03 
.Day\04 



In the above Example 1 , assuming that a doctor asks a patient for how long the patient has 
been coughing, the patient may then reply by saying one day, two days, etc.. The doctor may 
simply go down one level from "Cough" to the day descriptors, which include "Day\01", 
"Day\02", etc.. Another possible level immediately below "Cough" may be the severity of 
coughing, as in the following Example:- 



Example 2 
Cough 

.Severity\mild 
. Severity\moderate 



.Severity\Severe 
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More complicated versions of the hierarchical arrangement of the levels of topic 
descriptors are shown below in Examples 3 and 4:- 

Example 3 
Orange 

.Amount 

.Number 
.001 
.002 
.003 
.004 

.Box 

.001 
.002 
.003 

.Colour 

.Blue 

.Red 

.Orange 

.Green 

.White 

.Black 
.Sweetness 

.Sweet 

.Sour 

.Bitter 

Example 4 

Transportation 
.Car 
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.Truck 
.Lorry 
.Van 
.Sedan 

.Brand\A 

.Brand\B 

.Brand\C 

.Ship 

.Bicycle 

.Aeroplane 

.Plane 

.Fighter 

This invention is also based on the realization that some words may be grouped into a 
common category in accordance with the significance and/or meaning of the words. For 
example, "cars", "ships" and "aeroplanes" are all means of transportation, and may thus be 
grouped under the common category of "transportation". On the other hand, "pigs", "cows" 
and "goats" may all be grouped under the common category of "animals". Of course, in order 
to be more specific, "pigs", "cows" and "goats" may be classified under the common category 
of "animals, vertebrates". This may be termed a "common root" feature of the words. 

In this way, a hierarchically arranged database or central library with topic descriptors 
arranged into various levels may then be constructed on the basis of the common language 
structure and common root feature discussed above. It can also be seen that under each level, 
there may be more than one immediate sub-level. As in the case of the level "Amount" in 
Example 3, there are two immediate sub-levels, namely "Number" and "Box". 

Such a database and the relevant software may be stored in an optically readable data disc, 
e.g. a compact disc, or other data carrier media, which may then be transferred to and installed 
in a hard disc of a computer or other data processing apparatus for future use. Alternatively, it 
is also possible to download the relevant program into the data processing apparatus from a 
computer network, e.g. the Internet. 
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The database or central library may be tailored made for the specific user. It is generally 
known that, while most people have a common stock of vocabulary, different professions or 
trades have their own respective vocabulary, which are seldom used by other professions or 
trades. For example, lawyers have some terms or vocabulary which are very seldom, if ever, 
used by a doctor or an architect. Different versions of the database may thus be provided, each 
tailored made for a specific profession or trade, or a specific range of professions or trades. 

Brief Description of the Drawings 
An embodiment of the invention will now be more specifically described below and with 
reference to the accompanying drawings, in which:- 

Fig. 1 shows a first display from a visual display unit of a data processing apparatus during use 
of a method and program according to the present invention; 

Fig. 2 shows a second display from the visual display unit in Fig. 1 during use of the method 
and program according to the present invention; 

Fig. 3 shows a third display from the visual display unit in Fig. 1 during use of the method and 
program according to the present invention; 

Fig. 4 shows a fourth display from the visual display unit in Fig. 1 during use of the method and 
program according to the present invention; 

Fig. 5 shows a fifth display from the visual display unit in Fig. 1 during use of the method and 
program according to the present invention; 

Fig. 6 shows a sixth display from the visual display unit in Fig. 1 during use of the method and 
program according to the present invention; 

Fig. 7 shows a seventh display from the visual display unit in Fig. 1 during use of the method 
and program according to the present invention; 

Fig. 8 shows schematically the relationship between a central library of the present invention 
and output mode thereof; 

Fig. 9A shows a first family tree diagram which may be represented in text form according to 
the method of the present invention; and 

Fig. 9B shows a partial second family tree diagram which may be represented in text form 
according to the method of the present invention. 
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Detailed Description of the Preferred Embodiments 
As a first example, the following sentence is to be inputted into the computer:- 

"I go to Woolworld Supermarket 
Buy 6 hen egg which is 
SoldatUSDl/kg" 

Conventionally, such a sentence has to be inputted by typing in alphabet after alphabet, 
and word after word. Errors may be committed in typing the words or numbers. According to 
the present invention, however, such errors may be avoided, or at least reduced. 

A data carrier, e.g. a compact disc, carrying a software according to the present invention 
is first installed into a memory, e.g. a hard disk, of the data processing apparatus, e.g. a 
computer, in which the method is to be carried out. Also optionally installable into the memory 
of the computer is a database of words arranged hierarchically into a plurality of levels. If not 
installed into the memory of the computer, the database may be accessed via a CD-ROM reader 
of the computer into which the compact disc carrying the software is received. 

When a user starts to input the above sentence into the computer, the relevant software 
will be executed to show a display, part of which is as shown in Fig. 1, on a visual display unit 
of the computer. The display is generally divided into an upper window 12 and a lower 
window 14. The relative size of the upper window 12 and the lower window 14 may be varied, 
in accordance with the preference of the user. In this example, the upper window 12 is smaller 
than the lower window 14. 

In the upper window 12, topic descriptors of various levels may be displayed, though not 
necessarily at the same time, available for designation and choice. The topic descriptors 
chosen will be shown in the lower window 14. As in this example, the first top descriptor to be 
entered is the subject of the sentence, which may be such words as "He", "I", "She", "They" 
"We", and "You". The possible choices are, in this example, arranged in alphabetical order in 
the upper window 12. The desired choice, i.e. the word "I" in this example, may then be 
highlighted by an arrow controllable by a mouse of the computer. In particular, clicking a left 
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button of the mouse when the arrow on the screen is on the line "SubjectM" in the upper 
window 12 will highlight the line, as shown in Fig. 1. A further click of the left button of the 
mouse will signify the choosing of this line, and the topic descriptor "SubjectM" will appear in 
the lower window 14, again as shown in Fig. 1, in a manner to be discussed below. 

Alternatively, a quick double-click of the left button of the mouse when the arrow on the 
screen is on the line "SubjectM" will also signify choosing of this line, whereupon the topic 
descriptor "SubjectM" will appear in the lower window 14. As a further alternative, a highlight 
bar may be caused to move upward or downward, by pressing the "T" and keys in a 
keyboard of the computer, from line to line until the required line of topic descriptor is 
highlighted. An "Enter" key on a keyboard of the computer may then be pressed to signify the 
choosing of this line of topic descriptor, whereupon the topic descriptor "SubjectM" will appear 
in the lower window 14. An advantage associated with the use of the "t" and "i" keys is that 
such a method may be used in data processing apparatus of a relatively small size, as only two 
to three keys are required. 

As to the word "go", such is an activity, and is thus grouped under the level "Activity" 
under the level "Action", as shown in Fig. 2. Under the same level "Activity" are also grouped 
such words as "Come" and "Find". It can be seen that before the word "Action" in the upper 
window 12 is a ".", which signifies that such is at a sub-level. 

It should also be noted that, once a certain level descriptor is chosen, the immediately 
below sub-level will be opened by being shown in the upper window 12. Using Example 4 as 
an example, when the level descriptor ".Car" is chosen, the immediately below sub-level 
".Truck", ".Lorry", ".Van", and ".Sedan" will be displayed in the upper window 12 for possible 
designation and choice. Similarly, when the level descriptor ".Sedan" is chosen, the sub-level 
immediately below it, namely ".BrandVA", ".Brand\B" and ".Brand\C" will be displayed in the 
upper window 12 for possible designation and choice, and so on. 

Referring back to the present example, further topic descriptors, such as 
"Supermarket\Woolworld" (see Fig. 3), "Action\CommercialM3uy" (see Fig. 4), 
"Food\Animal\EggMIen" (see Fig. 5), "Price\kg\USD\001.0" (see Fig. 6) and "Number\006" 
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(see Fig. 7) may be inputted in a similar way. When the lower window 14 is in the state as 
shown in Fig. 7, the user may activate the software in the computer to change it into a sentence 
in normal human language form, as follows :- 

"I go to Woolworld Supermarket 
Buy 6 hen egg which is 
SoldatUSDl/kg" 

It can be seen that, in the entry of the above text, the words are not keyed in as in the 
conventional manner. There is thus no risk of mis-typing the words. If it is later found that a 
certain word is mis-spelt in the database, e.g. the word "apple" is mis-spelt as "appel", the user 
may correct the database, and cause all occurrences of the word "appel" to be corrected to 
"apple". Thus each word will be inputted into the computer in the same way, i.e. the words are 
either consistently correct, or consistently wrong. In either case, internal consistency is 
ensured. 

A further feature of the present invention is that it is possible to add new topic descriptors 
into the database as desired by the user. As discussed above, most people have a common stock 
of vocabulary. It is thus possible to provide a basic database which includes this common stock 
of vocabulary. The user is however allowed to expand on this basic database by including new 
words. If, therefore, there is a fourth supermarket called "Perfect Supermarket" not originally 
included in the basic database, as shown in Fig. 3, the user may include this supermarket in the 
database by typing in:- 

". SupermarketVPerfect" 

This new entry will then be stored in this database in the computer, and will be shown next time 
when necessary for possible designation and choice. 

Features are built into this method for enhancing the speed at which the text may be 
inputted into the data processing system, which, in practice, can closely match the speed of 
human speech. Firstly, all the words in the central database or library are indexed according to 
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the conventional manner. Secondly, a hot key feature is included, whereby when the user types 
in the alphabet "b", the words beginning with the alphabet "b" will be shown in the upper 
window 12 in menu form. If the user then types in the alphabet "o", words beginning with the 
alphabets "bo-" will be shown in the upper window 12, and so on, until the user designates a 
certain word in the upper window 12. 

Further, in order to ensure that the inputted text is appropriately categorized, dots (.) are 
used for signifying the level to which a certain descriptor refers. For example, in the following 
input: 

Car 

.Structure\Engine 
..piston 

the word "structure" is one level below "car", and "engine" is also one level below "structure", 
thus the "." and the "\". However, as "piston" is a part of the engine, the word "piston" should 
also be one level below "engine". Thus, in order to ensure that the word "piston" refers to 
"engine", a double dot precedes the word "piston", to signify that it is a sub-sub-level. 
Similarly, triple dots ". . ." and so on may be used. 

By way of the combination of the above features, namely (a) the indexing of the words in 
the central database or library, (b) the hot key feature, and (c) the categorization of the words 
into a hierarchical structure, text can be inputted into the data processing apparatus, e.g. a 
computer, speedily. In addition, as all the text are inputted into the data processing apparatus in 
a relational format, i.e. in a manner which signifies its relationship with the text entered 
therebefore or thereafter, the computer is in a position to discern such interrelationship. Such 
an aggregate text may thus later be acted on by the computer for other purposes. To put it in 
more popular language, it can be said that the text is entered into the data processing apparatus 
in a "logicalised" manner, so that the apparatus is, by reason of the present invention, endowed 
with artificial intelligence (AI), in that it can "understand" the text entered into it, and to act on 
it later. 
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The method discussed above may also be used for providing a textual description of a 
graph. A family tree diagram is shown in Fig. 9A, graphically representing a pair of parents 
having two children, namely a son and a daughter. Using the above method, such may be 
represented in text as: 

Children\Number\02 

Child\01\Male 

Chile\02\Female 

Such a method may be used for providing a textual description of the relation of a boy (marked 
by an arrow) with his elder brother and younger sister, as shown in the partial family tree 
diagram shown in Fig. 9B, as follows: 

Sibling\Number\03 
Sibling\Index\02 
Sibling\01\Male 
Sibling\03\Female 

In this method and the relevant software, each of the topic descriptors is associated with a 
memo field, in which computer commands in text form maybe inputted. The size of the memo 
field is practically indefinite, subject, of course, to the size of the memory of the computer. The 
computer language used for the commands may be FOXPRO, although other appropriate 
computer languages may also be used. The commands inputted into a memo field may be ones 
to direct the computer to display a certain topic descriptor/word in the lower window 14, if the 
topic descriptor/word is shown in the upper window 12 and chosen. It is also possible to 
associate a computer-generated event or command with the choice of a certain topic 
descriptor/word. For example, the memo field associated with the word "doorbell" may 
contain commands instructing the computer to generate and output a "Dong" sound, in which 
case choosing the word "doorbell" will cause a "Dong" sound to be outputted by the computer. 
This output of a "Dong" sound may replace, or be in addition to, the display of the word 
"doorbell" in the lower window 14. With this arrangement, it can be seen that there are 
virtually limitless ways in which the human user may interact with the computer. The choosing 
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of a topic descriptor/word may bring about the occurrence of various events, subject only to the 
design of the user. This can be shown more clearly in Fig. 8. 

As shown in Fig. 8, each choice/word in the central database/library is associated with a 
respective memo field, indicated as "Memo" in Fig. 8, as discussed above. Take Choice 1 as an 
example, Memo 1 associated with it may contain computer commands to display Text 1 on the 
lower window 14 of the visual display unit of the data processing apparatus. As to Choice 2, 
Memo 2 associated with it may contain computer commands for another kind of output, e.g. 
Action 2, which is to play a video file stored in the data processing apparatus. As to Choice 3, 
Memo 3 associated with it may contain computer commands to both display the Text 3 on the 
lower window 14 and output a computer-generated event, Action 3. It can be seen that such 
allows a large variety of choices for the user. In addition, the aggregate output may also form a 
feedback to the relevant memo field(s) to bring about one or more further outputs. 

It is found in practice that the above method is appropriate when used in the context of a 
question and answer dialogue, for example when a doctor seeing a patient. However, this 
method is not limited to this kind of context, and may indeed be used in more diverse 
situations. 

It should be understood that the above only illustrates an example whereby the present 
invention may be carried out, and that various modifications and/or alterations may be made 
thereto without departing from the spirit of the invention. 

It should also be understood that certain features of the invention, which are, for clarity, 
described in the context of separate embodiments, maybe provided in combination in a single 
embodiment. Conversely, various features of the invention which are, for brevity, described in 
the context of a single embodiment, may also be provided separately or in any appropriate 
sub-combinations. 



